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Introduction 

Serum  paraoxonase  (PON1)  is  a  mammalian  enzyme  that  catalyzes  the  hydrolysis  and, 
thereby,  inactivation  of  various  organophosphates  (OPs),  including  the  nerve  agents  sarin  and 
soman1.  In  recent  years  it  has  become  apparent  that  PON1,  and  its  two  known  Q  and  R 
isoenzymes,  also  play  important  roles  in  drug  metabolism  and  in  prevention  of  atherosclerosis1,2. 
PON1  is  the  best-studied  member  of  a  family  of  mammalian  enzymes  that  includes  PON2  and 
PON3,  which  share  ~60%  sequence  identity  with  PON1.  PON1  and  PON3  reside  in  the 
cholesterol-carrying  particles  HDL  (“good  cholesterol”),  whereas  PON2  is  found  in  many 
tissues.  Polymorphism  of  the  PON1  gene  affects  the  blood  levels  of  PON1  and  its  catalytic 
efficacy;  both  factors  have  a  major  impact  on  the  susceptibility  to  atherosclerosis,  and  to 
pollutants  and  insecticides1.  Mice  lacking  PON1  are  highly  susceptible  to  atherosclerosis  and  to 
OP  poisoning3.  In  vitro  assays  show  that  PON1  and  PON3  inhibit  lipid  oxidation  in  LDL  (“bad 
cholesterol”),  thus  reducing  levels  of  oxidized  lipids  involved  in  the  initiation  of 
atherosclerosis4,5.  Since  atherosclerosis  is  the  underlying  cause  of  50%  of  mortality  in  Western 
societies,  and  OPs  comprise  an  environmental  risk  as  well  as  a  terrorist  threat,  PONs  have 
become  the  subject  of  intensive  research. 

Despite  many  efforts,  the  structure  and  mechanism  of  action  of  PONs  have  remained 
enigmatic.  The  name,  paraoxonase,  is  purely  historical,  since  the  PON  family  is  a  hydrolase 
family  with  one  of  the  broadest  specificities  known.  PON1  is  a  proficient  esterase  towards 
several  synthetic  substrates,  whilst  PON2  and  PON3  exhibit  high  lactonase  activity.  But  the 
paraoxonase  activity  of  PON1  is  rather  weak,  and  PON2  and  PON3  exhibit  almost  no 
paraoxonase  activity1.  However,  all  these  activities  towards  man-made  chemicals  are 
promiscuous  activities  of  PONs  rather  then  their  primary  functions.  A  variety  of  physiological 
roles  have  been  proposed  for  PONs,  including  phospholipase  A2  action6,  degradation  of  oxidized 
lipids7),  and  hydrolysis  and  inactivation  of  homocysteine  thiolactone  -  a  known  risk  factor  for 
atherosclerotic  vascular  disease8.  The  anti-atherosclerotic  activity  of  the  PONs  is  intimately 
linked  to  their  localization  on  HDL  particles.  It  has  been  suggested  that  the  hydrophobic  N- 
terminus  of  PON1  mediates  its  anchoring  to  HDL9. 

Structural  and  functional  characterization  of  the  PONs,  and  their  engineering,  have  been 
hindered  by  lack  of  an  ample  source  of  recombinant  protein.  We  recently  described  the  directed 
evolution  of  PON1  and  PON3  variants  that  express  in  a  soluble  and  active  form  in  E.  coli,  and 
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exhibit  enzymatic  properties  identical  to  those  reported  for  PONs  purified  from  sera10.  We  report 
the  crystal  structure  of  a  recombinant  PON1  variant  derived  from  rabbit  PON1,  which  is  highly 
similar  to  human  PON1.  By  combining  directed  evolution,  site-directed  mutagenesis  and  kinetic 
studies,  we  provide  a  comprehensive  description  of  the  overall  architecture  of  PON1,  details  of 
its  active  site,  catalytic  mechanism  and  HDL-binding  mode.  We  show  how  directed  evolution  in 
the  laboratory  follows  the  footsteps  of  natural  evolution  in  swiftly  providing  new  PON  variants 
with  pre-determined  catalytic  specializations. 
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Crystallization  and  structure  determination 

Previous  attempts  to  determine  the  structure  of  PON  1  relied  on  limited  amounts  of  serum- 
purified  proteins,  resulting  in  crystallization  of  a  protein  that  co-purified  with  it11.  Human  PON1 
is  rather  unstable,  and  tends  to  aggregate  in  the  absence  of  detergents12.  These  factors  led  us  to 
directly  evolve  PONs  for  bacterial  expression  and  increased  solubility10.  Family  shuffling  of  four 
PON1  genes  (human,  rabbit,  mouse  and  rat),  and  screening  for  esterolytic  activity,  led  to 
recombinant  PON1  variants  (rePONls)  that  express  in  E.  coli.  These  variants  diverged  from 
wild-type  (wt)  rabbit  PON1  by  14-31  amino  acids  coming  from  other  PON1  genes,  and  exhibit 
enzymatic  properties  essentially  identical  to  those  of  wt  PON110.  Variants  from  the  1st  round  of 
evolution  aggregated,  and  none  crystallized.  The  2"d-generation  variants  (obtained  by  shuffling  of 
the  1st  generation  variants  and  screening  for  highest  expression  levels)  did  not  aggregate,  and  at 
least  one  (G2E6)  gave  stable  and  well  diffracting  crystals. 

RePONl-G2E6  exhibits  91%  identity  to  wt  rabbit  PON1,  with  the  vast  majority  of  variations 
deriving  from  human,  mouse,  or  rat  wt  PONl.  It  should  be  noted  that  rabbit  and  human  PONls 
are  also  highly  homologous  in  sequence  (86%)  and  function13.  Moreover,  sequence  variations 
between  rePONl-G2E6  and  rabbit  and  human  PONl  are  in  regions  that  do  not  affect  their  active 
sites  or  overall  structures.  Both  purified  native  rePONl-G2E6  and  rePONl-G2E6  containing 
selenomethionine  (SeMet-  rePONl-G2E6)  were  crystallized,  yielding  isomorphous  crystals  of 
space  group  P432,2.  The  structure  was  solved  by  single  isomorphous  replacement  anomalous 
scattering  from  data  collected  to  2.6A  on  the  SeMet  crystals  and  to  2.2  A  on  the  crystals  of  the 
native  protein14.  The  structure  shows  all  residues  except  N-terminal  residues  1-15  and  a  surface 
loop  (72-79).  Two  calcium  atoms,  a  phosphate  ion,  and  1 15  water  molecules  are  also  seen. 

The  overall  architecture  of  PONl 

PONl  is  a  6-bladed  |3-propeller,  with  each  blade  containing  4  strands  (Fig.  1).  The  ‘velcro’ 
closure  characteristic  of  this  fold15  is  supplemented  by  a  disulphide  bridge  between  Cys42  (strand 
6D)  and  Cys353  (strand  6C).  This  covalent  closure  of  the  N-  and  C-termini  is  rarely  seen  in  13- 
propellers  with  more  than  four  blades,  but  is  conserved  throughout  the  PON  family. 
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Two  calcium  ions,  7.4A  apart,  are  seen  in  the  central  tunnel  of  the  propeller:  one  at  the  top 
(Ca-1)  and  one  in  the  central  section  (Ca-2).  Ca-2  is  most  probably  a  ‘structural  calcium’  whose 
dissociation  leads  to  irreversible  denaturation16.  Ca-1  is  assigned  as  the  ‘catalytic  calcium’16.  It 
appears  to  interact  with  five  protein  residues  (the  side-chain  oxygens  of  Asn224,  Asn270, 
Asnl68,  Asp269  and  Glu53),  2.2-2.5A  away.  Two  other  potential  ligands  are  a  water  molecule, 
and  one  of  the  oxygens  of  a  phosphate  ion.  The  two  calcium  ions  exhibit  markedly  different 
affinities13.  The  ligation  of  Ca-1  is  more  extensive  than  that  of  Ca-2’s.  However,  two  of  Ca-l’s 
ligating  residues  (Asn224,  Asp269)  exhibit  distorted  dihedral  angles.  This,  and  the  higher  solvent 
accessibility  of  Ca-1,  indicate  that  Ca-2  is  the  higher  affinity  calcium. 

PONl’s  structure  resembles  that  of  Loligo  Vulgaris  DFPase17.  Both  are  6-bladed  propellers 
with  two  calcium  atoms  in  their  central  tunnel.  They  also  share  functional  homology,  since  both 
exhibit  phosphotriesterase  (PTE)  activity,  although  PON1  is  primarily  an  esterase  or  lactonase. 
However,  there  is  no  clear  sequence  homology  between  them  (BLAST  E-score  »3.6),  although 
more  sensitive  algorithms  indicate  weak  but  significant  similarity11.  Closer  inspection  reveals 
that  PON1  and  DFPase  differ  significantly  in  their  overall  architecture,  active-site  structure  and 
mechanism.  In  particular,  PON1  possesses  a  unique  addition  in  the  form  of  an  active-site  canopy 
defined  by  helices  H2  and  H3  and  the  loops  connecting  them  to  the  P-propeller  scaffold.  This 
addition  provides  PON  with  an  uncharacteristically  closed  active  site,  since  most  (3-propellers, 
including  DFPase,  exhibit  uncovered  active  sites  defined  only  by  loops  that  connect  the  (3- 
strands.  It  seems  to  play  a  critical  role  in  PONl’s  function,  both  in  defining  the  active-site 
architecture  and  sequestering  it  from  solvent,  and  in  anchoring  PON1  to  HDL. 

Although  detergent-solubilized  PON1  forms  dimers  and  higher  oligomers12,  in  the  crystal 
there  is  only  one  molecule  per  asymmetric  unit,  and  very  few  contacts  between  symmetry-related 
molecules.  It  is  possible  that  crystallization  favors  a  monomeric  form,  but  it  seems  more  likely 
that  oligomerization  of  PON1  is  a  consequence  of  its  anchoring  to  detergent  micelles  in  a  mode 
similar  to  its  anchoring  to  HDL.  Mammalian  PON1  is  glycosylated,  but  glycosylation  is  not 
essential  for  hydrolytic  activity10,18.  There  are  four  potential  N- glycosylation  sites  on  PON1 
(NX(S/T)  sites).  Two,  Asn227  and  Asn270,  are  in  the  central  tunnel  of  the  propeller,  and  are 
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largely  inaccessible  to  solvent.  Asn253  and  Asn324  are  located  on  surface  loops,  and  are  most 
probably,  as  previously  proposed18,  PONl’s  glycosylation  sites. 

Directed  evolution  of  substrate  specificity 

Site-directed  mutagenesis  is  routinely  used  to  identify  active-site  residues.  This  approach 
suffers,  however,  from  a  well-recognized  drawback:  Loss  of  activity  does  not  necessarily 
indicate  direct  involvement  of  a  particular  amino  acid  in  the  protein’s  function  since  mutations 
often  disrupt  the  overall  structure.  Indeed,  whereas  certain  residues  identified  by  site-directed 
mutagenesis  as  being  essential  for  PONl’s  activity1820  are  related  to  its  active  site,  others  are  not 
(e.g.,  Trp281).  In  contrast,  mutations  identified  following  directed  evolution  towards  a  modified 
function  are  inevitably  relevant  to  activity,  and  involve  residues  located  within,  or  in  the  vicinity 
of,  the  active  site. 

Gene- libraries  of  rePONl  were  prepared  by  random  mutagenesis  and  cloned  in  bacterial 
colonies  as  described10.  Colonies  on  agar  plates  were  screened  with  several  different  substrates 
representing  each  of  the  substrate-reaction  types  catalyzed  by  PONs:  PTE,  lactonase  and 
esterase.  For  the  latter,  both  acetate  and  octanoate  esters  were  employed,  representing, 
respectively,  ‘short  chain’  esters  towards  which  PON1  exhibits  high  activity,  and  ‘long  chain’ 
esters,  that  are  typical  substrates  of  lipases  for  which  PONs  exhibit  lower  activity.  The  best 
clones  identified  from  a  screen  of  103-104  library  clones  were  shuffled,  and  the  resulting  libraries 
screened  for  the  same  activity.  Two  to  four  rounds  of  selection  were  performed,  after  which  the 
sequences  and  catalytic  activities  of  the  selected  variants  were  determined  (Table  1). 

The  newly  evolved  variants  clearly  define  a  set  of  amino  acids  the  alteration  of  which 
dramatically  shifts  PONl’s  reactivity  and  substrate  selectivity  (Table  1).  These  shifts  involve  not 
only  an  increase  of  16-46-fold  in  activity  towards  the  substrate  for  which  each  particular  variant 
was  evolved,  but  also  a  drastic  decrease  in  activity  on  substrates  which  had  not  been  selected  for 
(6-167-fold).  Overall,  shifts  in  substrate  selectivity  of  up  to  4,600-fold  were  observed.  Thus,  for 
example,  variant  7HY  exhibits  relative  to  wt  PON1,  46-fold  higher  long-chain  esterase  activity 
and  100-fold  lower  PTE  activity.  Some  of  the  new  variants,  all  derived  from  PON1,  represent 
substrate  and  reaction  selectivities  more  similar  to  PON2  or  PON3.  For  example,  variants  2AC 
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and  1HT  exhibit  ~20-fold  higher  esterase  and  lactonase  activity  relative  to  PON1,  and  5-30-fold 
weaker  PTE  activity.  The  positions  identified  by  directed  evolution  all  appear  in  the  same  region 
at  the  top  of  the  p -propeller.  They  clearly  delineate  the  entrance  to  and  walls  of  PONl’s  active 
site  (Fig.  2). 

The  catalytic  mechanism 

At  the  very  bottom  of  the  active-site  cavity  lie  both  the  upper  calcium  (Ca-1)  and  a  phosphate 
ion,  which  was  present  in  the  mother  liquor  (Fig.  2b).  One  of  the  oxygens  of  this  phosphate  is 
only  2.2A  from  Ca-1,  and  it  may  be  bound  in  a  mode  similar  to  the  intermediates  in  the 
hydrolytic  reactions  catalyzed  by  PON.  One  of  its  negatively  charged  oxygens  that  nearest  to  Ca- 
1,  may  mimic  the  oxyanionic  moiety  of  these  intermediates  stabilized  by  the  positively-charged 
calcium.  This  type  of  ‘oxyanion  hole’  is  seen  in  secreted  phospholipase  A2  (PLA2)21,  and  has 
also  been  suggested  for  DFPase17.  Two  other  phosphate  oxygens  may  be  mimicking  the  attacking 
hydroxyl  ion  and  the  oxygen  of  the  alkoxy  or  phenoxy  leaving  groups  of  ester  and  lactone 
substrates. 

To  help  elucidate  the  mechanism  of  action  of  PON  we  determined  its  pH-rate  profile  with 
two  typical  substrates:  an  ester,  2-naphthyl  acetate  (2NA),  and  a  phosphotriester,  paraoxon.  Both 
profiles  exhibit  a  bell-shaped  curve.  The  minor  basic  shoulder  fits  an  apparent  pKa  of  9.8 
(paraoxon)  or  9.0  (2NA),  probably  reflecting  the  deprotonation  of  a  basic  side-chain  that  affects 
the  active  site  but  is  not  directly  involved  in  catalysis.  The  fully  pronounced  acidic  shoulder,  of 
apparent  pKa  ~7.1,  may  be  ascribed  to  a  His  imidazole  involved  in  a  base-catalyzed,  rate¬ 
determining  step.  In  hydrolytic  enzymes,  His  often  serves  as  a  base,  deprotonating  a  water 
molecule  and  generating  the  attacking  hydroxide  ion  that  produces  hydrolysis.  In  secreted  PLA2, 
the  attacking  hydroxide  is  generated  by  a  His-Asp  dyad,  in  which  the  imidazole  acts  as  a  base  to 
deprotonate  a  water  molecule,  and  the  Asp  carboxylate  increases  the  imidazole's  basicity  via  a 
proton-shuttle  mechanism.  The  closest  His  nitrogen  in  PLA2  is  6.3A  from  the  catalytic  calcium, 
and  two  water  molecules  are  involved:  one  attacking  the  substrate  (after  deprotonation),  and 
another  ‘catalytic  water’  that  mediates  between  the  attacking  water  and  the  His  base21.  The 
DFPase  active  site  also  contains  a  His-Glu  dyad  with  a  His  nitrogen  1.2k  from  the  catalytic 
calcium17. 
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Did  PON  1  adopt  the  same  mechanism  -  namely  a  His-Glu/Asp  dyad  acting  as  base  on  a  two- 
water-molecule  cascade?  PONl’s  active  site  contains  such  a  dyad  (Aspl83,  Hisl84).  However, 
His  184  is  ~llA  from  Ca-1,  and  the  Hisl84Asn  mutant  of  human  PON1  is  active20.  Another 
putative  dyad  is  His285-Asp269.  Yet  Asp269  ligates  Ca-1,  and  His285  is  ~8A  from  Ca-1  and 
~5A  from  the  nearest  phosphate  oxygen.  We  identified,  however,  a  His-His  dyad  near  both  Ca-1 
and  the  phosphate  ion  (Fig.  3).  We  hypothesized  that  His  115  (the  closer  nitrogen  of  which  is 
only  4.1  A  from  Ca-1)  acts  as  a  general  base  to  deprotonate  a  single  water  molecule  and  generate 
the  attacking  hydroxide,  while  His  134  acts  in  a  proton  shuttle  mechanism  to  increase  Hisll5’s 
basicity.  Interestingly,  Hisll5  adopts  distorted  dihedral  angles  -  a  phenomenon  observed  in 
catalytic  residues  of  many  enzymes.  This  assignment  for  the  catalytic  mechanism  was  supported 
by  the  Hisl  15Gln  mutation,  which  resulted  in  a  dramatic  decrease  (~2xl04  fold)  in  activity,  and 
the  Hisl34Gln,  that  produced  a  smaller,  but  substantial,  decrease  (6-150  fold). 

However,  following  a  recent  report  by  Yeung  and  coworkers22who  introduced  the  Hisll5Trp 
mutation  into  human  PON1,  that  is  highly  homologous  to  our  recombinant  PON1,  we  carried  out 
further  mechanistic  studies.  We  thus  discovered  that  our  previous  measurements  of  the 
Hisll5Gln  and  Hisl34Gln  mutants  were  biased  by  two  factors.  First,  the  Hisll5Gln  mutant 
accidentally  carried  another  mutation  (Hisl77Arg)  incorporated  during  by  PCR  amplifications  of 
the  gene.  We  found  that,  in  itself,  this  mutation  in  a  residue  far  way  from  the  active-site,  has  little 
effect  (paraoxonase  and  aryl-esterase  activities  are  70%  of  that  of  the  parental  PON1  gene).  But 
the  double  Hisll5Gln/Hisl77Arg  mutation  led  to  an  almost  complete  loss  of  activity  as  reported 
in  the  original  article.  Second,  both  mutants  (Hisll5Gln  and  Hisl34Gln)  are  unstable  and  prone 
to  loss  of  activity,  misfolding,  and  ultimately  aggregation.  We  have  now  regenerated  the  same 
mutants  (including  the  Hisll5Gln  in  isolation),  purified  them  with  the  permanent  presence  of 
detergent  (0.1%  tergitol;  in  the  lysis,  loading  and  elution  buffers)  and  10%  glycerol,  and  assayed 
them  immediately  after.  Their  activities  are  display  in  Table  2. 

The  existence  (or  even  increase)  in  the  paraoxonase  activity  of  these  mutants  suggests  that 
the  new  purification  conditions,  and  the  removal  of  the  accidental  Hisl77Arg  mutation  excluded 
the  global  effect  of  these  mutations  on  the  protein  stability  and  folding.  Furthermore,  we  have 
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since  incorporated  the  same  mutations  into  the  recombinant  PON1  variant  G2E6,  the  one  whose 
3D  structure  was  determined14,23  and  into  another  recombinant  PON1  variant,  G3C9,  with  similar 
results.  Thus  our  most  recent  data  suggest  that  Hisll5  and  Hisl34  are  not  responsible  for  the 
paraoxonase  activity  of  PON1,  although  they  appear  to  significantly  affect  its  aryl-esterase 
activity.  Thus,  the  possibility  must  be  considered  that  the  two  hydrolytic  activities  are  catalyzed 
by  different  active-site  residues,  and  that  the  His-dyad  model  postulated  (Fig.  3)  may  be  valid 
only  for  the  aryl-esterase  activity. 

Cys284,  which  is  conserved  in  all  PONs,  has  been  proposed  to  fulfill  alternative  functions  of 
PON1  related  to  atherosclerosis24.  It  is  part  of  a  highly  conserved  stretch  that  includes  active-site 
His285,  and  is  packed  against  four  highly-conserved  residues  from  the  adjacent  strands  (267, 
268,  303  and  305;  Fig.  4).  Since  it  is  buried,  it  is  unlikely  that  it  has  a  functional  role.  Its 
mutation,  however,  is  likely  to  destabilize  the  core  structure,  thereby  affecting  function 
indirectly.  Indeed,  we  found  that  Cys284  mutants  of  rePONl  are  poorly  expressed  and  relatively 
unstable. 

Anchoring  of  PON1  to  HDL 

PON1  and  PON3  are  synthesized  in  the  liver,  and  secreted  into  the  blood,  where  they 
specifically  associate  with  HDL.  HDL  is  a  particle  of  ~10  nm  diameter,  composed  primarily  of 
membrane  components  (phospholipids,  cholesterol  and  cholesterol  esters),  and  apolipoprotein  A- 
I  (apoA-I)25,  the  amphipathic  helices  of  which  are  thought  to  wrap  around  the  particle’s 
membrane-like  bilayer  in  a  belt-wise  manner26. 

PON1  retains  its  hydrophobic  N-terminus,  which  resembles  a  signal  peptide,  and  is  thought 
to  be  involved  in  anchoring  of  PON1  to  HDL9.  Most  of  the  N-terminus  is  disordered  and 
invisible  in  the  crystal  structure;,  yet  its  hydrophilic  part,  which  extends  beyond  the  signal 
peptide  (residues  19-28),  adopts  a  helical  structure  (HI).  The  entire  sequence  of  the  N-terminus 
is  compatible  with  a  transmembrane  helix,  yet  following  a  secondary  structure  prediction,  we 
modeled  only  residues  7-18  as  part  of  HI  (Fig.  5).  Helix  H2,  adjacent  to  HI,  possesses  a  clearly 
amphipathic  character.  Unexpectedly,  however,  its  hydrophobic  face  points  towards  the  solvent, 
as  do  several  residues  from  the  two  loops  that  connect  H2  to  the  propeller  scaffold.  Helices  HI 
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and  H2  form,  therefore,  two  adjacent  hydrophobic  patches  that  clearly  provide  a  potential 
membrane-binding  surface  (Fig.  5a).  The  interface  with  HDL  was  further  defined  by  a 
characteristic  ‘aromatic  belt’  rich  in  Trp  and  Tyr  side  chains,  and  a  Lys  side  chain  on  HI27. 
Notably,  the  glycosylation  sites  point  away  from  the  interface  (Fig.  5b). 

The  crystal  structure  reveals  the  overall  fold  of  the  PON  family,  and  the  details  of  the  3D 
structure  of  PON1,  and  permits  the  presentation  of  a  plausible  catalytic  mechanism.  Such  a 
mechanism,  based  on  a  His-dyad,  has  not  been  described  before,  although  its  key  elements  are 
reminiscent  of  secreted  PLA2.  Catalysis  of  both  C-0  and  P-O  hydrolysis  at  one  site  is  unusual 
but  not  unprecedented28,29.  The  structure,  the  directed  evolution  results,  the  pH-rate  profiles,  and 
previous  biochemical  data1,10  show  that  both  these  activities  indeed  take  place  at  the  same  site. 
There  is,  however,  a  possibility  that  certain  PON1  activities  (e.g.,  as  a  homocysteine 
thiolactonase8)  make  use  of  a  different  subset  of  residues  of  this  site,  including  His285,  whose 
side  chain  also  points  towards  the  center  of  the  cavity  and  to  the  phosphate  ion.  In  addition, 
nucleophilic  catalysis  by  His  115  cannot  yet  be  ruled  out,  although  there  is  currently  no  evidence 
to  support  catalysis  via  an  acyl-  or  phosphoryl-enzyme  intermediate. 

The  3D-structure  does  provide  a  hint  regarding  the  origins  of  PONl’s  remarkably  wide 
substrate  range.  Hydrophobicity  is  common  to  almost  all  its  effective  substrates.  The 
hydrophobicity  and  depth  of  PONl’s  active  site  explain  this  preference,  and  account  for  the  fact 
that  PONl’s  substrates,  whether  poor  or  effective,  exhibit  KM  values  in  the  millimolar  range,  but 
dramatically  different  kca,  values1,10.  PONl’s  multi-specificity  is,  therefore,  driven  primarily  by 
non-specific  hydrophobic  forces,  as  observed  for  other  enzymes  that  possess  deep  hydrophobic 
active  sites,  such  as  acetylcholinesterase30.  We  postulate  that  poor  substrates,  as  well  as  effective 
ones,  bind  at  the  active  site  with  similar  affinity;  yet  the  mode  of  binding  differs,  since  the  poor 
substrates  are  inadequately  positioned  relative  to  Ca-1  and  to  the  catalytic  base.  It  seems  likely 
that  the  mutations  observed  following  the  directed  evolution  process  reshape  the  active  site  walls 
and  perimeter,  thereby  improving  the  positioning  of  some  substrates  (and  of  their  respective 
catalytic  intermediates  and  transition  states)  and  worsening  that  of  others.  Reshaping  of  the 
active  site  walls  is  also  the  driving  force  behind  the  evolutionary  divergence  of  various  PON  sub¬ 
families. 
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Polymorphism  of  the  PON  gene,  and  its  effects  on  susceptibility  to  OP  poisoning  and  to 
atherosclerosis,  are  the  subject  of  intensive  research.  The  two  most  common  PON  1  forms  are 
Q/R192  and  L/M55.  The  3D  structure  of  the  recombinant  PON  crystallized  reveals  that  Lysl92 
is  part  of  the  active-site  wall.  In  human  PON1,  this  position  is  normally  arginine  (R),  but  a 
commonly  observed  polymorphism  to  glutamine  (192Q)  results  in  an  approximately  10-fold 
decrease  in  paraoxonase  activity  accompanied  by  a  higher  susceptibility  to  atherosclerosis1. 
Given  the  drastic  effects  of  changes  in  other  active-site  residues  on  substrate  selectivity  (Table 
3),  the  human  192Q  variant  may  indeed  exhibit  significantly  reduced  activities  towards  PONl’s 
physiological  substrates,  resulting  in  increased  susceptibility  to  atherosclerosis.  The  Leu55Met 
mutation  may  significantly  affect  PONl’s  stability,  and  thereby  result  in  the  lower  enzymatic 
activity  observed31.  This  may  be  ascribed  to  the  key  role  of  Leu55  in  packing  the  propeller’s 
central  tunnel,  and/or  to  the  fact  that  its  neighboring  residues  (Glu53  and  Asp54)  ligate  both  Ca-1 
and  Ca-2. 
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A  detailed  analysis  of  the  amino  acid  variations  between  rePONl  variant  G2E6 
and  wild-tvpe  Rabbit  PON1 

As  mentioned  above  10,  all  rePONl  variants  that  efficiently  express  in  E.  coli  are  chimeras  of 
four  mammalian  PONls  (Human,  Mouse,  Rat  and  Rabbit)  that  show  79-  95%  sequence  identity 
by  Clustal  W  analysis  (Fig.  6).  These  PON1  genes  were  recombined  to  create  the  shuffled  DNA 
library.  The  rePONl  variants  were  explicitly  selected  so  that  their  enzymatic  properties  remain 
unchanged  relative  to  wild  type  (wt)  PON1.  This  functional  identity  was  confirmed  with  over  a 
dozen  different  substrates,  including  esters,  phosphotriesters,  lactones,  thiolactonesi,  as  well  as 
esters  of  long-chain  carboxylic  acids.  Their  biological  activities  are  also  very  similar  to  those  of 
wt  human  and  rabbit  PON1.  Selected  variants  were  found  to  be  most  similar  to  RabPONl  (rabbit 
PON1)  with  fewer  contributions  from  the  other  three  parental  wild  type  (wt)  genes.  Variant 
G2E6  for  which  the  structure  was  solved  exhibits  91%  similarity  to  RabPONl.  Overall,  there  are 
31  amino  acid  differences  between  rePONl -G2E6  and  wt  RabPONl.  Twenty  eight  of  these 
differences  come  from  other  wt  PON1  genes,  and  only  three  are  mutations  in  the  real  sense.  As 
analyzed  below,  these  differences  are  all  in  regions  that  do  not  seem  to  affect  the  overall 
structure  of  the  enzyme  or  its  active  site. 

The  amino  acid  differences  between  rePONl -G2E6  and  wt  RabPONl  can  be  divided  to  three 
groups  (Table  4).  The  first  group  is  comprised  of  8  residues  that  originate  from  the  other  parental 
genes  (Human,  Mouse  and  Rat)  and  were  found  to  be  mostly  within  the  interior  of  the  protein 
(Fig.  7).  These  residues  are  conserved  among  all  rePONl  variants  that  efficiently  express  in  E. 
coli  (10  and  Fig.  6).  The  structure  indicates  that  several  of  these  residues  are  in  contact  with  each 
other.  For  example,  Leul30  and  Vall43  that  pack  strand  2B  against  2C,  or  Val320  (strand  5D), 
and  Leu341  and  Ile343  (strand  6B),  that  are  critically  located  at  the  propeller’s  hydrophobic  core 
between  the  fifth  and  the  sixth  blades.  Residues  of  the  first  group  most  probably  facilitate  better 
packing  of  the  hydrophobic  core,  and  prevent  misfolding  and  aggregation.  Their  absolute 
conservation  amongst  the  rePONl  variants  indicates  that  they  are  the  key  to  soluble  expression 
in  E.  coli.  Their  location,  distant  from  the  active  site,  explains  why  all  the  rePONl  variants 
exhibit  enzymatic  activity  similar  to  that  of  wt  PON1.  The  second  group  includes  amino  acid  that 
are  not  conserved  among  other  rePONl  variants  (e.g.  rePONl-G3C9),  yet  originate  from  the 
other  parental  PON1  genes  (Human,  Mouse  and  Rat).  These  amino  acids  are  located  mostly  on 
the  protein  surface  (Fig.  7).  Three  of  these  residues  (105,  107,  297)  are  involved  in 
intermolecular  contacts  in  the  crystal.  Hence  they  probably  facilitate  the  successful 
crystallization  of  rePONl -G2E6.  Most  of  the  residues  are  far  from  the  active  site,  at  the  bottom 
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face  of  the  propeller.  Only  one  residue,  Phe293,  is  part  of  PONl’s  active  site.  However,  this 
residue  anyway  varies  between  Tyr  (e.g.,  RabPONl)  or  Phe  (e.g.,  Human  PON1,  Mouse  PON1 
and  Rat  PON1;  Fig.  6)  in  various  wt  PON  Is  that  are  known  to  exhibit  almost  identical  enzymatic 
properties. 

The  third,  and  smallest,  group  includes  three  residues  that  were  mutated  during  the  shuffling 
and  amplification  processes  and  differ,  therefore,  from  all  four  parental  genes.  These  residues  are 
all  located  on  the  protein  surface  (Fig.  7),  and  include  positions  19,  123  and  260.  Position  19 
(Gly  to  Arg)  varies  between  the  different  wt  PON  Is  (Fig.  6)  that  are  known  to  exhibit  almost 
identical  enzymatic  properties.  In  addition  this  position  was  mutated  during  the  directed 
evolution  of  PON1  and  selected  (data  not  shown)  without  having  any  observable  effect  on  the 
enzymatic  activity.  The  mutation  at  position  123  (Glu  to  Asp)  is  subtle;  this  residue  is  located  on 
the  surface  and,  therefore,  is  highly  unlikely  to  have  any  effect  on  the  enzymatic  function.  The 
third  residue  (260)  was  mutated  from  Lys  to  Arg.  This  is  also  a  subtle  mutation  that,  being  on  the 
surface,  is  unlikely  to  change  the  protein’s  properties. 

In  conclusion,  it  seems  that  none  of  the  sequence  variations  seen  between  rePONl-  G2E6 
and  wt  rabbit  PON1  are  likely  to  affect  its  structure  or  function.  PON1  appears  to  be  insensitive 
to  mutations  at  many  positions,  and  can  tolerate  multiple  amino  acid  substitutions  at  its  surface 
and  core,  with  no  effect  on  its  enzymatic  function. 

Expression  and  purification  of  native  rePONl -G2E6  and  its  SeMet  derivative 

rePONl -G2E6  was  expressed  as  a  thioredoxin  (Trx)  fusion  protein,  with  an  incorporated  HIS 
Tag,  and  purified  by  chromatography  on  a  NiNTA  column  followed  by  High  Trap  Q  column 
(Pharmacia) 10.  During  attempts  to  purify  and  crystallize  it,  we  noticed  that  during  storage  the 
linker  between  the  Trx  and  rePON  1  was  spontaneously  cleaved,  and  the  crystals  were  comprised 
only  of  rePONl.  The  origins  of  this  cleavage  are  still  under  investigation.  It  was  observed  in  all 
rePONl  variants,  and  could  even  be  mediated  by  PON1  itself.  Subsequent  crystallizations  were 
set  up  with  the  cleaved  and  purified  rePONl  as  described  below.  Following  purification  of  the 
Trx  fusion  product  by  Ni-NTA  and  ion  exchange  chromatography10,  the  protein  was  incubated  at 
25°C  for  ten  days.  The  cleavage  was  monitored  by  SDS-PAGE  and  mass  spectrometry.  The 
cleaved  protein  was  concentrated  and  applied  to  HiLoad  26/60/  Superdex  200  (preparative  grade, 
Pharmacia).  Fractions  from  the  main  peak  were  analyzed  again  by  SDS-PAGE,  and  enzymatic 
activity,  pooled,  and  concentrated  to  lOmg/ml.  Sodium  azide  was  added  to  a  concentration  of 


15 


0.02%.  Mass  spectrometry  indicated  a  mass  of  40223±201Da.  N-terminal  Edman  sequencing 
gave  the  following  a  sequence:  H2NDDDKAM.  Both  types  of  data  are  consistent  with  cleavage 
of  the  linker  (expected  mass  spec  40108Da)  5  amino  acids  before  the  methionine  residue  that  is 
PONl’s  first  amino  acid. 

The  SeMet-labeled  rePONl-G2E6  was  obtained  as  follows:  a  pET  32b  plasmid  containing 
rePONl-G2E610was  freshly  transformed  into  B834(DE3)  cells  and  plated  on  LB  agar  plates 
supplemented  with  Ampicillin.  Colonies  from  three  agar  plates  were  scraped  and  rinsed  (by 
resuspension  and  centrifugation)  with  M9  salt  solution  supplemented  with  2mM  MgS04,  0.4% 
Glucose,  25  /<g/ml  FeSCk,  40  //g/ml  of  each  of  the  20  natural  amino  acid  except  L-methionine, 
40/< g/ml  of  seleno-L-methionine,  1  /4g/ml  of  vitamins  (Riboflavin,  Niacinamide,  Pyridoxine 
monohydrochloride  and  Thiamine)  and  100pg/ml  of  Ampicillin.  Typically,  1  L  of  M9  minimal 
media  with  the  above  supplements  was  inoculated  with  5  ml  of  rinsed  E.  coli  colonies  and  grown 
at  30°C  to  ODfsoonm  =  0.7.  Cultures  were  then  transferred  to  20°C  and  IPTG  added  to  0.5mM. 
Growth  was  continued  for  another  36  hours  at  20°C,  after  which  the  cells  were  harvested,  lysed 
and  purified  as  above.  Mass  spectrometry  indicated  the  incorporation  of  six  SeMets  per  rePONl 
including  the  Trx  tag.  Purification,  cleavage  from  Trx  and  isolation  of  rePONl  were  performed 
as  above. 

Structure  determination  The  structure  was  determined  for  rePONl  variant  G2E6  (for  its 
sequence  see  Fig.  4).  The  rePONl  variants  were  crystallized  by  the  microbatch  method  under 
oil32  using  a  Douglas  Instruments  robot  IMP  AX  1-5  Data  were  collected  on  beamline  ID  14-4  at 
the  European  Synchrotron  Radiation  Facility  and  processed  with  XDS33.  Three  datasets  were 
collected  for  the  SeMet  protein  at  100K  at  Se  peak  wavelength  (0.9794A)  in  order  to  increase  the 
redundancy  and  accuracy  of  the  Se  anomalous  signal  while  monitoring  the  extent  of  radiating 
damage.  A  data  set  of  the  native  crystal  was  collected  at  a  wavelength  of  0.9796A.  Data  were 
collected  on  beamline  ID  14-4  at  the  European  Synchrotron  Radiation  Facility  (ESRF)  and 
processed  with  XDS33.  Data  collection  statistics  are  given  in  Table  5. 

Although  PON1  bears  no  sequence  similarity  to  any  other  protein  sequence,  it  was  suggested 
that  it  might  have  a  6-beta  propeller  conformation.  As  a  consequence,  we  attempted  to  solve  its 
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structure  by  molecular  replacement  (MR)  using  the  structure  of  3-carboxy-cis,  cis-muconate 
lactonizing  enzyme  (PDB-code  ljof)  as  a  search  template.  A  weak,  but  still  significant,  MR  peak 
was  found  by  the  maximum  likelihood  program  PHASER34.  Although  this  did  not  result  in 
structure  solution,  it  did  help  in  selecting  the  correct  space  group  (P432,2).  Three  Se  sites  were 
located  on  the  basis  of  the  anomalous  difference,  using  SHELXD  after  local  scaling  using 
XPREP35.  SHELXE  confirmed  the  correct  space  group  and  solvent  content.  Good  experimental 
SIRAS  (single  isomorphous  replacement  anomalous  scattering)  phases  were  obtained  using  the 
program  SHARP36  while  refining  3  Se  sites  against  the  2.2A  native  and  2.6A  Se  SAD  data, 
resulting  in  an  overall  figure  of  merit  (FOM)  of  0.11/0.06  for  the  acentric/centric  reflections 
respectively.  The  isomorphous  difference  phasing  power  was  very  low  (0.22  overall)  due  to  the 
lack  of  isomorphism  between  the  native  and  SeMet  data  sets.  However,  the  anomalous  phasing 
power  for  the  SeMet  SAD  data  set  was  good  to  at  least  4A  (0.74  overall).  Phases  were  improved 
by  applying  solvent  flipping  density  modification  using  SOLOMON37,  as  directed  by  SHARP 
using  a  63%  solvent  content,  giving  an  overall  FOM  of  0.88.  An  automated  tracing  program 
ARP/wARP38,  using  native  amplitudes  to  2.2A,  coupled  with  experimental  phase  restraints, 
resulted  in  an  automatic  tracing  of  ca.  95%  of  the  chain.  Manual  model  completion  was 
performed  using  program  O39,  iterated  with  refinement  using  REFMAC40.  The  refinement  and 
model  statistics  are  listed  in  Table  5. 

Figures  were  created  with  PyMol  ('http://pymol.sourceforge.netA  and  MolScript  (Fig.  3; 
http://www.avatar.se/molscript').  Accessible  surface  area  was  calculated  by  the  program 
AREAIMOL  in  the  CCP4  package41.  Coordinates  were  deposited  in  the  PDB,  entry  lv04. 

Directed  evolution  rePONl  variant  G3C9  was  used  as  the  starting  point  for  directed 
evolution.  It  exhibits  a  sequence  almost  identical  to  those  of  wt  rabbit  PON1  and  rePONl  variant 
G2E6,  whose  3D-structure  was  determined  (91%  identity),  and  the  same  enzymatic  parameters. 
Libraries  were  generated  and  cloned  as  described10.  PTE  activity  was  screened,  and  subsequently 
quantified,  with  the  fluorogenic  OP  substrate,  7-0-diethylphosphoryl-3-cyano-7- 
hydroxycoumarin  as  described10.  Thiolactonase  activity  was  initially  screened  for  with  2NA  on 
agar  plates.  Positive  clones  were  picked  from  replica  plates  and  grown  in  96-well  plates  as 
described10.  The  crude  cell  lysates  and  purified  proteins  were  assayed  for  hydrolysis  of  y- 
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butyrothiolactone  using  5,5'-dithio-bis-2-nitrobenzoic  acid  for  detection  of  product  by 
absorbance  at  412  nm.  Esterase  activity  was  screened  with  the  fluorogenic  substrate  7- 
acetoxycoumarin  (for  short-chain  esterase  activity),  or  with  2-naphthyl  octanoate  (for  long-chain 
esterase  activity)  using  fast  red  for  the  detection  of  2-naphthol10.  Colonies  exhibiting  the  highest 
activity  were  grown  in  96-well  plates,  and  the  crude  cell  lysate  was  assayed 
spectrophotometrically  at  365  nm,  for  hydrolysis  of  7-acetoxycoumarin,  and  at  320  nm  with  2- 
naphthyl  laurate.  The  activity  of  the  variants  described  above  was  determined  with  the  same 
substrates  and  assays. 

Enzyme  kinetics  kca,  and  KM  values  were  determined  for  rePONl-G2E6  with  2NA  and 
paraoxon10,  at  pH  5.69.5.  Buffers  used  were  MES  (pH  5. 6-6. 5)  and  bis-tris  propane  (pH  6.5-9.4) 
at  0.1M,  plus  ImM  CaCl2;  the  ionic  strength  was  adjusted  to  0.2M  with  NaCl.  Kinetic 
parameters  were  obtained  from  3-5  independent  measurements  averaged  with  standard 
deviations  of  2-23%. 
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Tables 


Table  1.  New  P0N1  variants  generated  by  directed  evolution 


Variant 

Phospho- 

triesterase 

activity" 

Lactonase 

activity" 

Esterase  activity" 

short  chain 
ester 

long  chain 
esters 

rePONl 

(wt-like  activity) 

3.5*103 

1 .4*1 02 

3.0*104 

1.7*102 

Directly-evolved  variants  with  ‘specialized’  substrate 

SELECTIVITIES 

# 

7PG 

V346A 

■ns 

wmmmm 

HuHMi 

4PC 

L69V,  S193P, 
V346A 

5.7*  104 
(16.3) 

■Kem 

n.d 

1HT 

I291L,  T332A, 

G339E 

■ 

l 

n.d. 

2AC 

F292S,  V346M 
V30A,  E249K 

l 

1 

1 

1 

7HY 

F292V, 

Y293D,  I109M 

■ 

■Bn 

m 

■ 

Hh 

■ 

■KBm 

4HY 

I74L,  F292L 

K84Q,  I343M 

5.3*10* 

(0.015) 

5.9 

(0.04) 

5.2*104 

_ CLZ) _ 

6.5*103 

_ m _ 

"Activities  are  expressed  as  pmoles  of  product  released  per  minute  per  mg  enzyme.  In 
parentheses  are  the  activities  of  the  new  variants  relative  to  wt  PONl’s  activity  on  the  same 
substrate. 

*  Mutations  are  given  in  relation  to  the  sequence  of  the  wt-like  variant  G3C9.  In  bold  are 
positions  found  to  be  mutated  in  all  the  highest-activity  variants  for  a  given  substrate.  Typically, 
the  same  mutations  could  be  individually  identified  in  the  sequence  of  selected  variants  from  the 
1st  and  2nd  rounds  of  evolution,  and  appear  together  in  the  3rd  generation  variants.  Mutations  that 
appear  in  only  one  selected  variant,  but  not  in  others  selected  for  the  same  substrate,  and/or  do 
not  appear  in  the  1st  and  2nd  round  of  evolution,  are  noted  in  regular  print. 
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Table  2  New  P0N1  variants  generated  by  site-directed  mutagenesis 


Mutation 

Aryl-esterase 
(%  relative  to  wt) 

Paraoxonase 
(%  relative  to  wt) 

Hisll5Gln 

0.6% 

32% 

Hisll5Trp 

0.056% 

195% 

Hisl34Gln 

9.9% 

Table  3.  Selectivity-determining  residues  of  PONs 


Sub-family 

PON1 

PON2 

PON3 

Newly-evolved  PONls 

Position 

Residue 

Selectivity 

69 

L 

L 

L 

V/I 

PTE" 

74* 

I 

L 

M 

L/M 

lactonase/esterase 

75* 

K/M 

K/H 

P 

76* 

S 

S 

N/A 

78* 

N/D 

A 

A 

190 

Y 

F/I 

L/V/F 

192 

K/R 

K/M 

S/A/V 

193 

S 

Y/F 

F/L 

P 

PTE" 

196 

M 

M/T 

M 

222 

F 

S 

S 

240 

L 

I 

V 

291 

I 

L/V 

L 

L 

Lactonase 

292 

F 

F/Y 

L 

L/V/S 

Esterase 

293 

F/Y 

V/Y/I 

N/I 

D 

332 

T 

S 

S/T 

A 

Lactonase 

346 

V 

L/V 

IN 

A 

PTE" 

"  PTE  =  phosphotriesterase.  It  should  be  noted  that  the  PTE  activity  of  these  mutants  is  much 
higher  than  that  of  wt  PON1  which  is  the  best  PTE  of  the  three  PONs  (Table  1). 

*  Residues  74-79  belong  to  the  selectivity-determining  residues  which  differ  between  the  PON 
subfamilies  (Fig.  4)  but  are  conserved  within  them.  These  residues  are  part  of  a  mobile  loop 
(72-79)  containing  residue  74,  which  is  not  seen  in  the  electron  density  but  is  presumably  part  of 
the  active  site. 

c  Lipase-like  activity  refers  to  esters  of  long-chain  carboxylic  acids  (Table  1). 
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Table  4.  Amino  acid  changes  in  G2E6  relative  to  Rabbit  PON1 


Group 

Mutations8 

Origin 

Location  and  nature® 

I 

T126,  L130,  S138, 
V143.G301,  V320, 
L341, 1343 

wild  type 

Human,  Mouse,  Rat, 
PON1 

hydrophobic  core; 
conserved  residues 
among  all  soluble 

PON  variants. 

II 

M12,  K93,  E94,  A96, 
S98,  E101, 1103, 
N105,  L107, 1109, 
1121,  E149,  V261, 
S263,  D265,  F293, 
A296,  E297,  E313, 
D354. 

wild  type 

Human,  Mouse,  Rat, 
PON1 

protein  surface;  non- 
conserved  between 
different  soluble 

PON1  variants. 

III 

R19,  D123,R260 

protein  surface 

a  The  rePONl-G2E6  sequence  was  aligned  to  the  rabbit  PON1  protein  sequence  using  ClustaWI. 
Amino  acid  identities  are  to  the  rePONl-G2E6  sequence. 

b  Mutations  that  occurred  during  the  shuffling  process  into  an  amino  acid  that  is  different  relative 
to  all  four  wt  PON1  genes. 

c  The  precise  locations  of  these  amino  acids  on  PONl’s  structure  are  shown  in  Fig.  7. 
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Table  5  Data  collection  and  refinement  statistics. 


Data  collection 


Native 

SeMet  protein 

Wavelength  (A) 

0.9796 

0.9794 

Unit  cell  (A) 

98.44, 139.17 

98.49, 139.56 

Space  group 

P432i2 

P432i2 

Resolution  range  (A) 

20-2.2 

30-2.6 

Number  of  unique 
reflections 

35,312 

39,473 

Completeness  (%)° 

99.7  (97.9) 

97.8  (97.7) 

1/0(1)* 

12.7  (2.7) 

13.7  (4.2) 

Rsvm(I)  (%)* 

8.6  (66.1) 

10.4  (51.0) 

Refinement  and  model  statistics 


Resolution  range  (A) 

20-2.2 

Number  of  reflections 

33,505 

R-factor:  work,  free  (%) 

18.5,21.7 

Average  B-factors  (A2) 

34.76 

RMSD  from  ideal  values: 

Bond  length  (A) 

0.028 

Bond  angle  (°) 

2.02 

Dihedral  angles  (°) 

28.7 

Improper  torsion  angles  (°) 

2.06 

Estimated  coordinate  error: 

Low  resolution  cutoff  (A) 

5.0  ! 

ESD  from  Luzzati  plot  (A) 

0.32 

ESD  from  SIGMAA  (A) 

0.34 

Ramachandran  outliers 

w* 

3.9 

a  Data  for  the  outer  shell  given  in  parentheses. 

b  Ramachandran  plot  outliers  are  all  glycines  except  for  HI  15,  N224,  D269  and  H348. 


Figures 


Fig.  1.  Overall  structure  of  PON1.  (a)  A  view  of  the  6-bladed  (3-propeller  from  its  top.  The  top  of 
the  propeller  is,  by  convention,  the  face  carrying  the  loops  connecting  the  outer  p-strand  of  each 
blade  (strand  D)  with  the  inner  strand  (A)  of  the  next  blade17.  Shown  are  the  N-  and  C-termini, 
and  the  two  calcium  atoms  in  the  central  tunnel  of  the  propeller  (Ca-1  in  green,  Ca-2  in  red),  (b) 
A  side  view  of  the  propeller,  including  the  three  helices  at  the  top  of  the  propeller  (H1-H3).  N- 
terminal  residues  1-15,  and  a  surface  loop  connecting  strands  IB  and  1C  (residues  72-79)  are 
not  seen  in  the  structure. 
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(a)  (b) 


Fig.  2.  The  active  site  of  PON  1  viewed  from  above  the  propeller,  (a)  Shown  are  the  central 
tunnel  of  the  propeller  with  the  two  calcium  atoms,  and  the  side-chains  of  the  residues  found  to 
be  mutated  in  the  newly-evolved  PON1  variants  for  esterase  and  lactonase  (in  orange)  or  for  PTE 
activity  (in  yellow),  including  position  192  of  the  Q/R  (Gln/Arg)  human  polymorphism.  Shown 
in  red  is  the  putative  catalytic  His-dyad  (see  The  catalytic  mechanism  section  and  Fig.  3).  (b)  A 
surface  view  of  the  active  site.  Lys70,  Tyr71and  Phe347  are  shown  as  sticks  to  permit  a  better 
view  of  the  active  site.  At  the  deepest  point  of  the  cavity  lies  the  upper  calcium  atom  (Ca-1  in 
green)  to  which  a  phosphate  ion  (P04)  is  bound. 


Fig  3.  The  postulated  catalytic  site  and  mechanism  of  PON1.  (a)  The  catalytic  site:  the  upper 
calcium  atom  (Ca-1),  the  phosphate  ion  found  at  the  bottom  of  the  active  site,  and  the  postulated 
His-dyad.  (b)  A  schematic  representation  of  the  proposed  mechanism  of  action  of  PON1  on  ester 
substrates  such  as  phenyl  and  2-naphthylacetate.  The  first  step  involves  deprotonation  of  a  water 
molecule  by  the  His-dyad  to  generate  an  hydroxide  anion  which  attacks  the  ester  carbonyl, 
producing  an  oxyanionic,  tetrahedral  intermediate.  This  intermediate  breaks  down  (second  step) 
to  an  acetate  ion  and  either  phenol  or  2-naphthol. 
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HKALYCDL. . 

- |e- . . . 

-r---|e- . . 

[x-t--|e-  . . . 

P-IH--INVS  E^^P-sheet 
332  ~  346-7  353  r~ — 1  Helix 


Active  site  residues 
Calcium  binding  residues 
Cysteine  or  Cysteine -related  residues 
Putative  glycosylation  sites 


Fig  4.  Sequence  alignment  of  representative  members  of  the  PON  family.  Shown  are  human 
PON1,  PON2  and  PON3  (with  an  ‘H’  prefix),  C.  elegans  PON1  (CPON1),  and  rePONl  variant 
G2E6,  the  structure  of  which  is  reported  here. 
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Fig  5.  The  proposed  model  for  anchoring  of  PON1  to  the  surface  of  HDL.  (a)  Tertiary  structure 
cartoon  of  rePONl  showing  its  exposed  hydrophobic  surfaces.  N-terminal  residues  7-18, 
missing  in  the  crystal  structure,  and  predicted  to  be  helical,  were  modeled  as  part  of  HI.  Denoted 
are  all  the  hydrophobic  residues  (L,F,P,I,Y,W,V)  appearing  with  accessible  surface  area  >20A2. 
(b)  Hydrophobic  residues  proposed  to  be  involved  in  HDL-anchoring  are  shown  with  their  side- 
chains  in  yellow.  The  line  -  defined  by  the  side  chains  of  Tyrl85,  Phe  186,  Tyrl90,  Trpl94, 
Trp202  (helix  H2  and  the  adjacent  loops)  and  Lys21  (helix  HI)  -  models  the  putative  interface 
between  HDL’s  hydrophobic  interior  and  the  exterior  aqueous  phase.  The  active  site  and  the 
selectivity-determining  residues  (Table  2)  are  marked  in  blue,  and  the  proposed  glycosylation 
sites  (Asn253  and  Asn324)  in  red.13,18 
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Fig  6.  Multiple  sequence  alignment  of  human  (H),  rat,  mouse  (M)  and  rabbit  (Rab)  PON1,  and 
rePONl  variants  G2E6  and  G3C9. 
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Fig  7.  Two  surface  representations  taken  from  the  side  of  the  p -propeller  -  one  rotated  180° 
relative  to  the  other.  The  amino  acid  variations  between  rePONl  variant  G2E6  and  wt  RabPONl 
are  marked:  In  red  are  amino  acids  that  originate  from  Human,  Mouse  and  Rat  PON1  and  are 
conserved  among  all  the  rePONl  variants  expressed  in  E.  coli  (Table  1,  group  I).  In  blue  -  amino 
acids  that  originate  from  Human,  Mouse  and  Rat  PON1  which  are  not  conserved  among  soluble 
rePONl  variants  (Table  1,  group  II).  In  green  -  amino  acids  that  were  mutated  during  the  DNA 
shuffling  process  (Table  4,  group  III).  The  figure  was  created  by  PyMol 
(http://pymol.sourceforge.net/). 


Key  Research  Accomplishments 

•  6  new  PON1  variants  were  generated  by  directed  evolution: 

o  7PC-V346A 
o  4PC  -  L69V,  S193P,  V346A 
o  1HT  - 129 1L,  T332A,  G339E] 
o  2AC  -  F292S,  V346M,  V30A,  E249K 
o  7HY  -  F292V,  Y293D,  I109M, 
o  4HY  -  I74L,  F292L,  K84Q,  I343M 

•  These  new  PON1  variants  displayed  a  repertoire  of  specificities  towards  ester  and 
organophosphate  substrates,  thus  demonstrating  the  potential  to  engineer  variants  with 
beneficial  scavenger  activities. 

•  Crystals  of  RePONl-G2E6  (which  exhibit  91%  identity  to  wt  rabbit  PON1)  were  obtained  as 
were  as  well  homologous  crystals  of  the  Se-Met  derivative 

•  X-ray  data  sets  were  collected  to  for  the  native  crystals  to  2.2A  and  for  the  Se-Met  derivative 
to  2.6A  resolution. 

•  The  X-ray  data  sets  were  used  to  determine,  for  the  first  time,  the  3D  crystal  structure  of 
paraoxonase. 

•  The  3D  structure  permitted  identification  of  the  active  site  and  led  to  a  proposed  catalytic 
mechanism  of  paraoxonase 
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Conclusions 


* 

*  * 


Directed  molecular  evolution,  which  involved  shuffling  of  the  PON1  genes  of  four 
mammalian  species,  was  used  to  produce  soluble  and  catalytically  active  paraoxonase  in  E.  coli 
in  milligram  quantities.  A  number  of  different  mutants  were  produced  which  differed  in  their 
specificity  towards  ester  and  organophosphate  substrates.  These  findings  demonstrate  the 
potential  of  molecular  evolution  to  generate  PON1  variants  tailor  made  for  serving  as  nerve 
agent  scavengers. 

One  new  variant  was  crystallized  and  its  crystal  structure  subsequently  determined.  The  3D 
structure  was  revealed  to  be  a  6-blade  ^-propeller.  The  active  was  found  to  be  within  the  central 
cavity  of  the  propeller  and  a  plausible  reaction  mechanism  was  suggested. 
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